Predicting 3D Tongue Shapes from Midsagittal Contours
نویسندگان
چکیده
This study is interested in whether there exists a predictable relationship between the midsagittal tongue contour and its related 3D tongue surface shape during speech. The assumption is that for any single language, a limited set of phonemically based 3D tongue shapes are used. If these shapes can be delineated and mapped to specific midsagittal displacements and cross-sectional (coronal) shapes, then predictions from midsagittal displacements to coronal shapes and then to 3D shapes can be made for specific speech sounds. The present study examined two ultrasound data sets: (1) the 3D static tongue surface reconstructions from a single subject (Stone & Lundberg, 1996), and (2) five coronal slices for two sentences spoken by a second subject (Yang & Stone, 2002). The midsagittal-to-coronal relationship for the 3D surfaces was extracted and applied to the continuous speech data. The predicted midsagittalto-coronal relationship was well captured. This result supports the idea that a knowledge of the 3D shapes of a language, even based on a single speaker, can then be used to transform 2D midsagittal data into a 3D surface for other data sets. INTRODUCTION AND BACKGROUND Any single language has a finite number of lingual phonemes and thus a limited set of 3D static tongue surface shapes. An inventory of these 3D tongue surface shapes can be made for all the lingual phonemes of the language. Each 3D surface can be deconstructed into a “chain” of 2D coronal 18 CRC_SP-Tabain_CH018.qxd 12/15/2005 12:35 PM Page 315
منابع مشابه
Predicting tongue shapes from a few landmark locations
We present a method for predicting the midsagittal tongue contour from the locations of a few landmarks (metal pellets) on the tongue surface, as used in articulatory databases such as MOCHA and the Wisconsin XRDB. Our method learns a mapping using ground-truth tongue contours derived from ultrasound data and drastically improves over spline interpolation. We also determine the optimal location...
متن کاملThree-dimensional tongue surface reconstruction: practical considerations for ultrasound data.
This paper discusses methods for reconstructing the tongue from sparse data sets. Sixty ultrasound slices already have been used to reconstruct three-dimensional (3D) tongue surface shapes [Stone and Lundberg, J. Acoust. Soc. Am. 99, 3728-3737 (1996)]. To reconstruct 3D surfaces, particularly in motion, collecting 60 slices would be impractical, and possibly unnecessary. The goal of this study ...
متن کاملReconstructing the tongue surface from six cross-sectional contours: ultrasound data
This work presents a method for reconstructing 3D tongue surfaces during speech from ultrasound data. The method reduces the dimensionality of the tongue surface and maintains highly accurate reproduction of local deformation features. This modification is an essential step if multiplane tongue movements are to be reconstructed practically into tongue surface movements. Earlier work (Stone & Lu...
متن کاملFrom real-time MRI to 3d tongue movements
Real-time Magnetic Resonance Imaging (MRI) at 9 images/s of the midsagittal plane is used as input to a threedimensional tongue model, previously generated based on sustained articulations imaged with static MRI. The aim is two-fold, firstly to use articulatory inversion to extrapolate the midsagittal tongue movements to three-dimensional movements, secondly to determine the accuracy of the ton...
متن کاملLinear degrees of freedom in speech production: analysis of cineradio- and labio-film data and articulatory-acoustic modeling.
The following contribution addresses several issues concerning speech degrees of freedom in French oral vowels, stop, and fricative consonants based on an analysis of tongue and lip shapes extracted from cineradio- and labio-films. The midsagittal tongue shapes have been submitted to a linear decomposition where some of the loading factors were selected such as jaw and larynx position while fou...
متن کامل